Intelligibility of machine translation output in speech synthesis
نویسندگان
چکیده
One use of text-to-speech synthesis (TTS) is as a component of speech-to-speech translation systems. The output of automatic machine translation (MT) can vary widely in quality, however. A synthetic voice that is extremely intelligible on naturally-occurring text may be far less intelligible when asked to render text that is automatically generated. In this paper, we compare the quality of synthesis of naturally-occurring text and its MT counterpart. We find that intelligibility of TTS on MT output is significantly lower than on either naturally-occurring text or semantically unpredictable sentences, and explore the reasons why.
منابع مشابه
Integrating Machine Translation and Speech Synthesis Component for English to Dravidian Language Speech to Speech Translation System
This paper provides an interface between the machine translation and speech synthesis system for converting English speech to Tamil text in English to Tamil speech to speech translation system. The speech translation system consists of three modules: automatic speech recognition, machine translation and text to speech synthesis. Many procedures for incorporation of speech recognition and machin...
متن کاملImproving speech synthesis of machine translation output
Speech synthesizers are optimized for fluent natural text. However, in a speech to speech translation system, they have to process machine translation output, which is often not fluent. Rendering machine translations as speech makes them even harder to understand than the synthesis of natural text. A speech synthesizer must deal with the disfluencies in translations in order to be comprehensibl...
متن کاملChallenges in developing a Voice Input Voice Output Communication Aid for People with Severe Dysarthria
This paper describes the development of a voice-input voice-output communication aid (VIVOCA) for people with disordered or unintelligible speech, initially concentrating on people with severe dysarthria. The VIVOCA is intended to recognize and interpret an individual’s disordered speech and speak out an equivalent message in clear synthesized speech. User consultation suggests that such a devi...
متن کامل2D Audiovisual Text-to-Speech Synthesis for Human-Machine Interaction in Dutch
Speech has always been the most important means of communication between humans. Therefore, using speech in machine-human communication can help in increasing the naturalness of the communication between a computer system and a user. Systems that can make a machine pronounce any given input text are referred to as text-to-speech systems. To further enhance the communication, a talking head can ...
متن کاملSpeech intelligibility after repair of cleft lip and palate
Background: Intelligibility refers to understandability of speech; and lack of it can negatively affect children’s overall communication effectiveness. Children with repaired cleft lip and/or cleft palate (CL/P) may experience poor speech intelligibility. This study aimed at evaluating speech intelligibility in children with repaired CL/P who had not been referred to sp...
متن کامل